two pass linker


代写完成–链接器 two pass linker


The target machine is word addressable and has a memory of 200 words, each consisting of 4 decimal digits. The first (leftmost) digit is the opcode, which is unchanged by the linker. The remaining three digits (called the address field) form either:

  • an absolute address, which is unchanged.
  • an immediate operand, which is unchanged.
  • a relative address, which is relocated.
  • an external address, which is resolved.
    Relocating relative addresses and resolving external references were discussed in class and are in the notes.

The input consists of a series of object modules, each of which contains three parts: definition list, use list, and program text. Preceding all the object modules is a single integer giving the number of modules present.

The linker processes the input twice (that is why it is called two-pass). Pass one determines the base address for each module and the absolute address for each external symbol, storing the later in the symbol table it produces. The first module has base address zero; the base address for module I + 1 is equal to the base address of module I plus the length of module I. The absolute address for a symbol S defined in module M is the base address of M plus the relative address of S within M. Pass two uses the base addresses and the symbol table computed in pass one to generate the actual output by relocating relative addresses and resolving external references.
The definition list is a count ND (Number of Definitions) followed by ND pairs (S, R) where S is the symbol being defined and R is the relative address to which the symbol refers. Pass one relocates R forming the absolute address A and stores the pair (S, A) in the symbol table.

The use list is a count NU (Number of Uses) followed by NU pairs (S, R), where S is an external symbol used in the module and R is a relative address where S is used. The (dummy) address initially in R is a pointer to the next use of S. This linked list of uses ends with a pointer of 777.

The program text consists of a count NT (Number of Text entries) followed by NT pairs (type, word), where word is a 4-digit instruction as described above and the type of the address component: 1=immediate, 2=absolute, 3=relative, and 4=external. NT is also the length of the module.

Other requirements

Your program must check the input for the errors listed below. All error messages produced must be informative, e.g., “Error: X21 was used but not defined. It has been given the value 111”.

1) if a symbol is defined but not used, print a warning message and continue.
2) if a symbol is multiply defined, print an error message and use the value given in the first definition.
3) if a symbol is used but not defined, print an error message and use the value zero.
4) if an address appearing in a definition exceeds the size of the module, print an error message and treat the address as 0 (relative).
5) if an immediate address (i.e., type 1) appears on a use list, print an error message and treat the address
as external (i.e., type 4).
6) if an external address is not on a use list, print an error message and treat it as an immediate address.
7) if an absolute address exceeds the size of the machine, print an error message and use the largest legal value.
You may need to set “arbitrary limits”, for example you may wish to limit the number of characters in a symbol to (say) 8. Any such limits should be clearly documented in the program and if the input fails to meet your limits, your program must print an error message. Naturally, any such limits must be large enough for all the inputs on the web. Under no circumstances should your program reference an array out of bounds, etc.

Your program must read an input set from standard input, i.e., directly from the keyboard. It is an error for you to require the input be in a file.